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Abstract 

The purpose of this paper is to extend von Davier, Holland, and Thayer’s (2004b) framework of 
kernel equating so that it can incorporate raw data and traditional equipercentile equating 
methods. One result of this more general framework is that previous equating methodology 
research can be viewed more comprehensively. Another result is that the standard error of 
equated score difference (SEED) has a wider application than originally proposed. The methods 
described in this paper are empirically evaluated in an accompanying simulation study (Moses & 
Holland, 2007). 
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Introduction 

Organizing frameworks for equating methodologies can be helpful for revealing 
relationships among studies of seemingly different scope as well as for generating new equating 
methodologies. For example, von Davier, Holland, and Thayer’s (2004b) recent framework 
related chained and post-stratification/frequency estimation equating approaches in terms of a 
design function. This framework provided a useful way to reconsider the comparisons of chained 
and post-stratification methods that focused on equated score differences (Holland, von Davier, 
Sinharay, & Han, 2006; Livingston, 2004; Livingston, Dorans, & Wright, 1990; MacCann, 1990; 
von Davier, Holland, & Thayer, 2004a; Wang, Lee, Brennan, & Kolen, 2006) and on theoretical 
variability estimates (Liou & Cheng, 1995). In addition, a significance test for chained and post- 
stratification equated score differences was a straightforward development produced within the 
von Davier et al. framework. 

The purpose of this paper is to add generality to von Davier et al.’s (2004b) framework of 
equating. The original framework focused on data that are pre-smoothed using loglinear models. 
This paper describes how raw data can be utilized throughout the steps of the original framework, 
promoting a broader consideration of the impact of smoothing on equating bias and empirical 
variability (Fairbank, 1987; Hanson, 1991; Hanson, Zeng, & Colton, 1994; Livingston, 1993b; 
Skaggs, 2004 ) and on theoretical standard error estimates (Liou & Cheng, 1995; Liou, Cheng, & 
Johnson, 1997). While the original framework focused on kernel equating, this paper shows how 
the traditional equipercentile methods can be incorporated, promoting a reconsideration of 
comparative evaluations of continuization methods (Grant, Zhang, Damiano, & Lonstein, 2006; 
Liou et al., 1997; Liu & Low, 2006; Livingston, 1993a). The result of adding generality to the 
original framework is that the tools developed in von Davier et al. for significance tests of 
equating function differences can be applied to raw data and to traditional equipercentile 
methodologies. This paper focuses on the technical details involved in raw and smoothed data and 
in traditional equipercentile and kernel equating for the nonequivalent groups with anchor test 
(NEAT) design. All of the procedures described in this paper are empirically evaluated in an 
accompanying report (Moses & Holland, 2007). 
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The Five-Step Equating Framework 

The empirical decisions practitioners make to equate tests include a pre-smoothing step 
that estimates true distributions from a set of collected data, an estimation step that calculates the 
probabilities of the test scores used in the equating process, a continuization step that uses the 
discrete scores to determine “in-between” scores, a score equating step, and a standard error 
estimation step. This section discusses these five steps in detail, describing specific applications 
that are relevant to the different equating functions used for the NEAT design. 

Step 1: Presmoothing 

The equating process begins with a decision on how to utilize the raw data that are 
gathered from test administrations. For the NEAT design the data are collected as two samples 
from nonequivalent populations (P and Q ) that take different total tests (X or Y) and one internal 
or external anchor (A in P and A in Q). The raw data take the form of two bivariate test score 
distributions (X, A in P and Y, A in Q). The scores of the bivariate distributions are discrete and 
assumed to be integers. The sampling error of the raw data can reduce the precision of the 
equating function but is itself reducible through the use of smoothing methods. Because of the 
potential for smoothing methods to introduce bias while reducing sampling variability, 
discussions of whether to use smoothed or raw data are typically framed in tenns of a bias- 
variability tradeoff (e.g., Kolen, 1991; Kolen & Brennan, 2004). 

Loglinear models for presmoothing. One useful tool for addressing the bias-variability 
tradeoff in discrete test score distributions is loglinear modeling (Holland & Thayer, 1987, 2000). 
Loglinear modeling has been promoted in test equating as an extremely flexible and variability- 
reducing smoothing technique for discrete test score distributions (Kolen, 1991; Livingston, 
1993b; Hanson, 1991; Rosenbaum & Thayer, 1987). The loglinear modeling framework for 
comparing the fits of several nested models makes it possible to select among strong and weak 
degrees of smoothing using structured significance tests (Agresti, 2002; Bishop, Feinberg & 
Holland, 1975; Habennan, 1974). 

For the NEAT design, two loglinear models are used to fit the two bivariate distributions. 
The following model is for the (X, A) distribution in P. The model for the (Y, A) distribution in Q 
is similar: 
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( 1 ) 


G F 


log.(/>„)=«+Z+ZA.W* + Z 'EPtWteY. 

g=l /=i 


i=i 


A=1 


where Pji is the joint score probability of the score (x,-, a/) (score x y - on test X and score at on test A 

as taken by population P ) and the a and P ’s are free parameters that are estimated in the model¬ 
fitting process. The fitting of this model produces a smoothed bivariate distribution that preserves 
/moments in the marginal distribution of A, //moments in the marginal distribution of A, and a 
number of cross-moments (G < I, F < // ) i n the bivariate XA distribution. Many values can be 
used for /, H, G, and F; larger values preserve more features of the raw data in the model than 
smaller values. 

The estimated variance-covariance matrix of population P’s probabilities ( ) modeled 

with (1) can be used for computing standard errors of kernel equating (von Davier et ah, 2004b). 

An efficient root factorization of is the so-called “C-matrix,” defined as 


: CpCp 


( 2 ) 


where C P is the JL (for an external anchor) by T (where T = I + H + GF in (1)) matrix and C p is 
its transpose. ^' P can be efficiently computed as 


C p =A ^ D A>Q. 


( 3 ) 


The diagonal matrix has entries a//A along its main diagonal. ^r is the total sample size in 
P, and Q is the JL by T orthogonal matrix that comes from the following QR- factorization, 


d „-Vpp 1 


B = QR 


( 4 ) 


Q is a JL by T matrix with orthogonal columns, R is a / by T upper triangular matrix, \fp and P 

are the column vectors of the yjPji ’s and PjP s respectively, and B is the matrix of the score 
functions of Xj and ai from (1) (Holland & Thayer, 1987). A major advantage of the C-matrix is 
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storage efficiency for large bivariate problems. (Note that there are only as many columns in the 
C-matrix as there are parameters in the model.) 

No presmoothing. The equating process can utilize data in their raw form without the 
use of loglinear modeling. The use of raw data and estimates of their sampling variability 
provides an important basis for evaluating the equating functions computed from some possibly 
under-parameterized and biased loglinear models and from other over-parameterized and highly 
variable loglinear models. When raw data are used, the original probabilities are used as 
estimates of the population distribution. The only issue is to estimate a raw data version of the 
C-matrix given in (3). 

To do so, let P = (l/^p) n p denote the vector of sample proportions based on the 
unsmoothed sample frequencies in vector n p, where n p is the JL “column-vectorized” version of 

the original/ by L matrix of bivariate frequencies. It is assumed that n p has the multinomial 
distribution M (Np, p), where Np denotes the total sample size and p is the column vector of 
population probabilities (Bishop et ah, 1975, p. 469). The covariance of P can be computed as 

£ P= (VN P ) ( Dp - pp l ), ( 5 ) 

where D p denotes the diagonal matrix with entries Pji along its main diagonal, and p l denotes the 
transpose of p. In comparison to (3), there is no loglinear presmoothing so that the dimensions of 

C p will be JL by JL rather than JL by T. Note that there is no reduction in the range of 
applications for the raw C-matrix as compared to the smoothed C-matrix currently used in kernel 
equating. 

C p is found as follows: Write in (5) as 

L p = (1 /N p ) 0^(1^ - -\/p "v/p”") D ^, (6) 

where is the diagonal matrix with entries yfpji along its main diagonal, yfp is the (column) 
vector of the \[Pji ’s, and Ijl is a JL by JL identity matrix. Equation (6) is justified by factoring 
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out from the D p and the pp in (5). The “inner” matrix in (6), that is, (I JL 
idempotent so that it equals it own square. Thus, (6) can be expressed as 


PVP )> is 




where = N p D r- (I 


JL 


'/p'/f )D jp =c P c;, 

(V) 

rr 

fp ( d ^-pVp )• 

(8) 


Similar developments of (5) in (7) and (8) have been given in Bishop et al. (1975, pp. 469-473). 
Step 2: Score Distribution Estimation 

The second step of equating requires an estimation of the distributions of the scores that 
are going to be used in the equating from the raw or smoothed data from Step 1. This estimation 
depends on a “design function” (von Davier et al., 2004b). For the NEAT design and observed 
score equating methods, there are two possibilities for utilizing anchor score information. One is 
an A to Y equating that is based on chaining the single-group links computed for A to A in P and 
for A to Y in Q. For chained equating, the needed score distributions are estimated by obtaining 
marginal score distributions for X P , A P , A q and Yq, the probability vectors of which are denoted 
as r P , t P , to, and s q. 

The other possibility is to use the anchor as a stratifying variable to estimate synthetic 
population score distributions on A and Fin P and Q. The synthetic score distributions are then 
used in post-stratification equating, also referred to as frequency estimation equipercentile 
equating (Jarjoura & Kolen, 1985; Kolen & Brennan, 2004). Score probabilities can be estimated 
in the synthetic populations for A and Y as 


\PSE r wP+(l-w)Q ’ r j 


SpSE S wP+(l-w)Q 


z 

/ 

. A = Z 


w + - 


n - w)t t 


Qi 


1 pi 


(\-w) + 


Wt 


PI 


t, 


Pji 

(9) 

■ <l k i. 

(10) 


QI 


where w is a value between 0 and 1 that specifies the proportion of P in the synthetic population, 
where the synthetic population is defined as w P+(\- w )Q, t pi and *qi are the marginal 
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probabilities at anchor score / in populations P and Q , and Pji and c lu are bivariate probabilities 
at scores Xj and A, in population P and at scores Yk and A/ in population Q. 

Raw data complications. With raw data that have not been presmoothed, there is a 
potential for some marginal score probabilities to be zero. This creates problems for two aspects 
of equating. For the traditional equipercentile method, more than one score exists with the same 
cumulative probability, and there is no clear basis for assigning the cumulative probability to any 
one score. For the marginal anchor scores that are used in kernel and traditional equipercentile 

post-stratification equating, division-by-zero problems can occur [ hi and t qi in Equations (9) and 
(10)]. To avoid problems from scores with zero frequencies, many ad hoc rules have been 
proposed (Jarjoura & Kolen, 1986; Kolen & Brennan, 2004; Livingston, 2004). 

The rule proposed in this paper is also ad hoc, but is proposed because it provides a direct 
basis for the estimation of equating errors in Step 5 while controlling for bias in Step 4, 
particularly for the post-stratification method. This rule is to average any marginal frequencies (or 
marginal synthetic probabilities multiplied by the sample size) less than one with the nearest 
lower and higher scores that had frequencies that were one or greater. These averaged frequencies 
and probabilities are then used in the rest of the equating processes. 

For post-stratification, the averaging is done on the marginal anchor scores, and then, to 
preserve the bivariate distribution and make (9) and (10) correctly sum to one, on each of the total 

test score probabilities in the bivariate tables. For example, if hi = 0 so that the marginal anchor 

tpi t ~^~tp3 

probabilities at scores / = 1,2, and 3 are averaged, this means that r is used for hi , 


hi and hi , 


Pll + P, 2 + P ,3 

3 


is used as the bivariate probability for Pn , P12 and Pn , 


Pi, + Pii + Pn 
3 


is used as the bivariate probability for P21 , P22 and P23 , and so forth for all J. 


Because (9) and (10) rely on p and q vectors with averaged frequencies and the variance- 
covariances of p and q (5-8) are computed from the original frequencies, the discussions of (9) 
and (10) to follow in Step 5 and the appendix will notate the p and q that are produced from some 
frequency averaging as P and 9 to distinguish them from the original p and q vectors. Marginal 
probability averaging for the levels of j and k in r and s may also sometimes be necessary because 
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eliminating low frequencies in the column sums of p does not necessarily eliminate low 
frequencies in the row sums of P. 

Step 3: Continuization 

The discrete score distributions of Steps 1 and 2 have to be continuized into continuous 
cumulative density functions (cdfs) with defined inverses. The continuization process is based on 
an assertion of how “in-between” score values are distributed in the discrete distribution. Two 
continuization possibilities treated here are the traditional equipercentile method’s percentile- 
rank method and the kernel method’s kernel density estimation method. There are other 
continuization approaches that are not addressed here (e.g., Wang, 2004). 

Traditional equipercentile continuization. The traditional equipercentile method of 
continuization is based on linearly interpolating between two scores and their cumulative 
percentages. The assumption is that scores and cumulative percentages are uniformly distributed 
between one-unit score intervals so that the cdf (i.e., “percentile rank”; Kolen & Brennan, 2004) 
for any x score, possible or impossible, can be computed by: 

F(x; r) = 0 if x < xi - 'A 

= {x - (xi - l A ))ri if x\ 14 < x < x\ + Vi , 

3 -1 

= ^ f + rj (x - (xj - 'A)) if Xj — Vi<x< xj + Vi, for j = 2,. . . J, 

k =1 


= 1 if x>xj+Vi. (11) 

The equipercentile continuization can be distinguished from the kernel continuization in two 
respects. First, the equipercentile’s degree of continuization is independent of the sample data. 
Second, the equipercentile’s continuization is based on the cumulative probabilities of pairs of 
consecutive scores in the score distribution. 

Kernel continuization. The continuization method for kernel equating is kernel density 
estimation for univariate score distributions. In von Davier et al. (2004b, p. 56), a Gaussian kernel 
is used, with a parameter (h) that can be altered so that the resulting cdfs are more or less 
influenced by the Gaussian kernel: 
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( 12 ) 


F h xfc r ) = 'Z r j®( R jx( x V 


n / x-a x Xj-(l-a x )n x 4 

where K jx( x J , , a x _ J 2 , ,2 , and ( P denotes the standard Gaussian cdf. 

a x n x \a x +h} 

The bandwidth parameter (h x ) can be set by the practitioner. Extremely large (> 1 0<7 X ) bandwidths 
produce cdfs that retain only the mean and variance of the data, causing the equating function to 
resolve into a linear form. Smaller values of h x produce cdfs that retain more of the data, von 
Davier et al. (2004b) recommend a rule for selecting h based on achieving a continuized cdf that 
closely approximates the discrete data (by minimizing the sum of the squared error) while having 
few modes (by avoiding rapid changes in the derivative of the cdf). One computational method 
that works well for meeting these two criteria in selecting h is known as Brent’s method or 
parabolic interpolation (Press, Teukolsky, Vetterling, & Flannery, 1992). 

Step 4: Equating 

Traditional equipercentile and kernel equating functions are implemented by first 
computing cdf values for scores from one distribution and then finding scores on another 
distribution with cdfs that match those of the first (i.e., the inverse df). The equipercentile and 
kernel versions of the cdf computations were described in Step 3. The equipercentile and kernel 
versions of the inverse cdf computations are described here. 

The traditional equipercentile inverse cdf. To compute an inverse cdf using the traditional 

equipercentile method, the percentile rank of one score on a score distribution, F( x j ;r ), is 
located on another score distribution such that G(y u +1/2/s f is the smallest percentile rank at an 

interval boundary score (i.e., - v +/- 1/2 for score intervals of 1) on s that is larger than l 7 ( x f r ). 
The inverse cdf is then computed as 


e y (x j ) = G- , (F(x j ;r);s) = y u -\/2 + 


F(x j ;x)-G(y u _ 1 TM2;%) 
G(y u +1/2; s)-G(y u _ 1 + 1/2; s)' 


(13) 


The kernel inverse cdf. The inverse cdf for the kernel method can be computed using an 
iterative search process, such as Newton’s method. Given a kernel continuized cdf, Fhx( x ji r ), the 
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- 1 ' score with a matching cdf value on s can be found by starting with a G hY (y ;s) that is based 

* / \ * 

on an initial y value that is plausibly close to e y \ x j) . Then y is updated as 

(F h x(Xj;r)-G ltY (y;s)) 


y =y 


8G liY (y;s) 

dy 


. When it is noted that both the derivative in the update’s 


denominator and the difference in percentile ranks in (13)’s denominator are probabilities of y 

scores near e y( x j) , the kernel method’s use of Newton for the inverse cdf computation and the 

inverse cdf computation for the traditional equipercentile method can be understood as being 

** 

almost identical. The Newton update is repeatedly calculated until a y is found such that 

G„(y m >' s ) is acceptably close to ^hx( x j > r ). The result can be expressed as the kernel equating 
function 


v( x j) =G h Y [ F hX ( x J ; r );*). 


(14) 


Equating functions. The chained kernel and equipercentile equating functions can be 
expressed as two cdf matchings brought together based on the four marginal distributions: 

e Y( cE)( x ) = G' I (H(H- 1 (F(x;r p );t p );t Q );s Q ) _ 

(15) 

The post-stratification kernel and equipercentile equating functions are based on matching 
the cdfs from the two synthetic populations: 


^y(pse) ( x ) G (F(x, r ( 




(wP+(l-w)Q)/’ J (wP+(l-w)Q) 


(16) 


Step 5: Standard Error Estimation 

The delta method is frequently used to estimate the extent of sampling variation in the 
equating function (Jarjoura & Kolen, 1985; Liou & Cheng, 1995; Liou et ah, 1997; Lord, 1982; 
von Davier et ah, 2004b). With the delta method the assumption is made that the XtoY equating 
function is continuously differentiable across X, so that equating variability can be estimated 
using the equating function derivatives and the corresponding estimates of variability in the data. 
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A major difference between how the delta method is treated in this paper and how it was 
developed by Jarjoura and Kolen (1986), Liou and Cheng (1995), Liou et al. (1997), and Lord 
(1982), is that the differentiation and variance-covariance estimation used here are in tenns of the 
probabilities at every score level rather than in terms of cumulative probabilities at a limited 
number of score levels. This orientation to score probabilities allows for a very flexible approach 
to equating function and standard error estimation that is straightforward to apply to all major 
equating designs (von Davier et ah, 2004b), to smoothed and raw data, to traditional 
equipercentile and kernel equating functions, and to the estimated variability of equated score 
differences. 

The pieces of the delta method standard errors are described below. They include the 
derivative of the equating function with respect to marginal score probabilities (J e , Steps 3 and 4), 
the derivative of the marginal score probabilities with respect to the original bivariate distribution 
(Jdf, Steps 1 and 2), and the C-matrices of the two estimated distributions (Step 1). The general 
form of the standard error is given as the root sum of squares of the “SEE-vector,” which is the 
product of the derivative vectors and the C-matrices: 

SEE y (x) = ||J e J DF C|| ( 17) 


In (17), 



denotes the Euclidian length (norm) of vector x. 


Equating function derivatives. Kernel and equipercentile equating functions have the 
general form c Y (x;r ,s) = G 1 (F (x;r);s) ? so that they can be differentiated with respect to 
marginal score probabilities as 


de Y _ 1 8F(x; r) 
drj G' drj 

3e y _ 1 dG(e Y (x); s) 

ds k G' ds k 


(18) 

(19) 


where 


G' 


8G(e Y (x); s ) 
dy 


(von Davier et al., 2004b). 
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Derivatives for the equipercentile equating functions. The derivatives of the continuized 
percentile-rank cdf in (11) at score x with respect to each xy marginal score probability requires 
that xj be located in the range of X such that x, - Vi < x < xj + Vi. Then 

dF(x\ r) 

dVj, 1 J <J ’ 

= (x- (Xj - 'A )) j’=j, 


= 0 


j’>f (20) 


8G(e fx); s ) 

The expression of---is similar. The next section of this paper gives a modification of 

ds„ 

(20) when raw and sparse data are used. 

The derivatives of the equated scores with respect to the actual Y scores are: 


8G(e y (x);s) 

8y 


~ s vu for e y ( x ) where T„ 1 / 2 < e y (x) < y u +1 /2 


= ^ for e y ^ where e y( x ) T« 1/2 (21) 

The averaging of two probabilities (i.e., derivatives) when the traditional equipercentile 

e y (x) 

is at a +/- 1/2 score boundary is a pragmatic proposal. It is not strictly correct because the 
delta method is intended for smooth and continuously differentiable functions. The percentile- 
rank derivative with respect to score probabilities is not continuous at +/- 1/2 score boundaries; it 
has two derivatives that depend on whether the rate of change in the percentile-rank function is 
approached from the upper or lower score around the score boundary. The precise way of dealing 
with equated scores that are exactly on score boundaries is to treat the derivative and resulting 
standard error estimate as undefined. The actual situations where equated scores are at score 
boundaries are rare except at the tails of score distributions, where standard error estimates are 
known to be poor for other reasons, namely sparse data (Jarjoura & Kolen, 1985; Liou & Cheng, 
1995). Simulation results described in Moses and Holland (2007) suggest that the use of an 
average of the two possible derivatives for equated scores at +/- 1/2 score boundaries has 
minimally adverse effects on standard error estimates. The averaging of derivatives in (21) is 
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completely avoided in kernel equating functions because kernel cdfs are smooth and continuously 
differentiable across the entire score range. 

Raw and sparse data. Equations (20) and (21) must be adjusted when frequency-averaging 
is done for sparse, marginal probabilities and traditional equipercentile equating. When 
frequency-averaging is used, the equating function is not based directly on the marginal 
probabilities. When the percentile-rank function, F(x;r) ? j s computed based on averaged 
marginal probabilities, it can be expressed as F ( x > r ), where r is the vector of original marginal 
probabilities where some are averaged. For example, when the probabilities at scores x i and x 6 

are zero, they are averaged with the probabilities at scores x i and x s and x s and x i , respectively. 
Then the percentile ra nk at score 6 can be expressed as 

F( x 6 = 5;r) = r l + r 2 +r 3 +r 4 +r 5 +r 6 ( 5 —(5 — 1/2)) # 


= 3 


r i+r 2 + r , 


+ r A + 


r 5 +r 6 +r 7 


+ 


r 5 +r 6 +r 7 


(5 - (5 -1 / 2)) 


= i\+r 2 + r 2 +r 4 + ^s +/ ^ +/7 j (l + 5 - (5 -1/2)) 
= r x + r 2 +r 3 + r 4 + ^ 5 + ^ 6 + ' 7 j (6 — (5 — 1/2)) 

= n +r 2 +r 3 + r 4 +( /S + 3 +/? ] (^ 7 -(^e -1/2)) 

= r x + r 2 +r 7 +r 4 + (x 6 - (x 5 -! / 2)). 


dF(x; r) 

Therefore, =1 j’ <j, 

= (x - (xj - l/2))( 1/b) f or any j ’ for which r j is averaged with 

r 3, 

= 0 7 ’ >7 + b - 1. 

( 22 ) 
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In (22), Xj is the lowest of the scores with averaged probabilities at x, and b is the number of score 
values with probabilities that are averaged together. Note that, when several scores have 
probabilities that are averaged together, b gets large, marginal probability distributions become 
uniform, equating functions approach the identity function, and the standard errors go to zero. 

Derivatives for the kernel equating functions. With respect to marginal score probabilities 
and scores, the derivatives of the continuized kernel cdf in (12) are given in von Davier et al. 
(2004b)as 


8F ^ r \^ ( R jAx ;r))-M jA x-.r) dF(x ’ r) 


dr. 


dx 


dF(x;r) ^ 1 

and 5 .. ^ r j^(^j X (-*-’ 1 *)) 


dx 


a x h x 


(23) 

(24) 


where Rjx (* j r ) 


x-a x x J -(\-a x ){i x 

a x^x 


M jX (x;r) 


1 , ( X - U y \ 

j( x - —--J + (1 -a x )x j} 


and ^ denotes the standard Gaussian density function. 

Equating function derivative vectors. When the vectors of score-level equating functions 

are computed, the equating function derivative vectors (i.e., Jacobians) can be merged into the 
that is used in (17). For post-stratification, the two Jacobians for each of the two synthetic score 
distributions are merged: 


f de v 


9 eY(PSE) 


8e v 1 


r)v r)v 

ur wP+(l-w)Q ^ wP+(l-w)Q 


=( J 


-i. 


rwP+(l-w)Q 9 u swP+(l-w)Q 


.)• 


(25) 


For chained equating, the Jacobians based on the four marginal distributions are merged: 


’ eY(CE) 


O) = ff ^r~(e a (x)) j ea w] , J eY (e a (x)) 


da 


7 


(26) 
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where J ea (x) 


( dejx) de/xj )' 

l dr p ’ otp 


(J rP? J t p), 




St, 


and 


de^ = Gll Q (eJx);t Q )/Ga 
da “ GG q (e Y (x); s Q )/Gy ' 


Design function derivatives. The design function for chained equating involves summing 
up the bivariate probabilities to create two marginal probability distributions, one for the test and 
another for the anchor scores. For error estimation, the test and anchor entries of the C-matrix are 
summed in the same way as the bivariate probabilities. For a formal description of the design 
function derivatives for chained equating, refer to the discussion of the single-group design in the 
appendix. 

The design function for post-stratification is nonlinear and can be written in terms of the 
two bivariate and four marginal probabilities used in (9) and (10): 


dr 

Fp, 


- (1 - W )f) 

5p, Sp, ( 




PI 


p, 


I'U 


ds 

dp i 


ds <9p. ( 

- - = w 

Fp, 5p, v 


W 


i 


dr _ dr Fqj 
0q, 5q, oq, 


(1 — w) 


(^/) 1 P. 


J Q 1 


ds 

dq t 


ds 3q, 
3q, 3q, 


l-w + w(^)I K 





(27) 

(28) 

(29) 

(30) 


where the Pi and fii terms denote computations involving the averaged form of the bivariate 
probabilities (Step 2), and the b’s denote the number of score levels on the marginal anchor 
distributions in P or Q that were averaged to avoid divisions-by-zero to compute (9) and (10). For 

smoothed data, Pi and Qi equal Pi and fii and all b’s equal 1. For raw and sparse data, some 
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anchor scores’ probabilities are averaged so that Pi and hi are not always equal to Pi and Oi and 
some b’s can be greater than 1. 

C. The final C in (17) is given as a stacking of the P and Q C-matrices, and assumes that 
the two samples’ probability distributions are estimated independently: 


fC P 0 

10 C Q J • 


Computations for raw data. For raw data, the SEE-vector uses a square C-matrix and can 

become so large that it exceeds storage capacity. Additional multiplications that have a (Je^DF )C 
orientation can result in SEE formulas that avoid the C-matrix in its large form. These are 
described in more detail in the appendix, with the main results presented here. 

For chained equating, the variance of the raw equating function is given as the sum of the 
variances of the equating function derivatives in (25): 

SEE 2 (.v) = [^(e„(x))] 2 SEE;(x) + SEE 2 (e.(x)), (32) 

where SEEfr) =(!/#,)!>, +\ -<Ej r p + (*j,p ))’/>„, 


Z i c)c 

( J s k ■ (Pj s q Pj,q )) Iki s -^-(e a (*)) was defined in ( 26 ), and 

the p’s denote averages of particular derivative vectors. 

For post-stratification equating, the variance of the raw equating function is given as the 
variances of the JeJ D F products from (25) and (27-30): 


SEE 2 (x) = (1/JVp) 


I(ViO> 


11 + (1 /Nq) 




(33) 


where A jl= ( w ip( J rj -A Jr p„) + w(/* Jr r|, + 


B u = w iQ^ J s k -^ 1 q| 1 ) + ( 1 - w )Kp|. + ^ 1 q I .) 


'Ql 
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w,p = w + (1 - w)t Ql /t pl = ( wt Pl + (1 - w)t & )/tp, = t ri /t pl ? 

and w iQ = (\-w) + w(t pl /t Q! ) = t ri /t Ql ' 

SEEDs. Equating functions are often selected based on comparisons with other equating 

functions (e.g., e i y ( x ) ~ e 2y ( x ) ), such as curvilinear and linear differences, chained and post¬ 
stratification differences, post-stratification differences when w is varied, and differences between 
linear methods (von Davier & Kong, 2005). von Davier et al. (2004b) showed that through noting 

o (e, - e 2 ) = J ei _ J e 2 , the SEE-vector fonnula in (17) can be generalized into a SEED-vector for 
the standard error of equating function differences: 

SEED y (x) = ||(J el J DF1 - J« 2 Jdf 2 )c||. (34) 

Two types of SEEDs are important in the NEAT design. One is for evaluating the 
significance of differences between curvilinear and linear kernel equating functions for a given 
design function (either chained or post-stratification). In the kernel framework, curvilinear and 
linear equating functions can be computed simply by selecting smaller and larger continuization 
bandwidths (h ’s ). Because the design function for the curvilinear and linear comparison is the 
same, the SEED can be expressed as 

SEED ; (x) = ||( J„ - J„) J DP C||. (35) 

Another type of SEED is useful for assessing the significance of differences between 
chained and post-stratification equated scores. For this type of comparison, there are separate 
design functions and derivative vectors, so (34) is used. 

Raw SEEDs. The raw SEED analogues of (34) are discussed in detail in the appendix. The 
final results are as follows. For the curvilinear-linear comparison in chained kernel equating, 

SEED 2 (x) - (l/^)SEEDf(x) + (l/A e )SEED^(e a (x)) , (36) 
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where SEEDf(x) = 


^ r del r (el a (x)) /t de2 r (e2 a (x)) /t y 

SEED 2 (e a (x)) = Z( J si k ' J s2 k + J ti, - J t2, '^(J sl -J s2 )Q ' ^(J t i-J t 2 )Q) q u. 

k,l 

For the curvilinear-linear comparison in post-stratification kernel equating, 


SEEDXt) = (MNp) 


A(el-e2)P ^ P 


il + (1 /Nq) 


Z(B (el-e2)kl " fiB(el-e2)Q ) k 


(37) 


where A ( ei-e 2 )ji= l w ir^ J '' 1 i Jr2 j ^ (JrI _ Jr2)P 11 ) + w (A ( j r ,-j r2)P 11 + A (Jsl -j s2)Q 11 )), , 

Upi 


and B 


(el-e 2 )kl 


= ( W ig(J S l k J S 2 k ^(J, 1 -J c )Q|l) + (^ W ^P\3 tl -J tl )V\\ + A(j sl -J s2) Q|l)) , . 

U Q1 


The raw SEED for assessing differences between chained and post-stratification equating 
functions is computed as 


SEED v (x) = JSEED 2 P (x) + SEED; 0 (x) 


yQ * 


whereSEED yP (x) ^ ^ (J rj i,p + 'fij rl p _ fij tl p) Ajup+ZCvp )pji, (38) 


p ji 


da 


SEED 2 Q (x) = -^^(J Stl Q + J M Q -fi JslQ -fij tlQ -B^p +// A2q ) 2 ? h , e i is the chained equating 

N Q kl 


function, and e i is the post-stratification equating function. 


Discussion 

This paper describes how raw data and traditional equipercentile equating functions can be 
incorporated within von Davier et al.'s (2004b) kernel equating framework. Previous studies of 
theoretical standard errors of equating (Jarjoura & Kolen, 1985; Liou & Cheng, 1995; Liou et ah, 
1997; Lord, 1982) addressed many issues discussed in this paper, but in a somewhat disjointed 
manner. This paper’s framework provides some additional structure to allow these previous issues 
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to be more easily related. It also provides a more general standard error of equating difference for 
post-stratification and chained equated scores. 

This paper reveals many issues of equating methodology that deserve further study. More 
general forms of the kernel and linear interpolation continuization methods could be explored, 
such as the utilization of a uniform kernel that is not limited to discrete score intervals and the 
utilization of the suggested frequency averaging rule and linear interpolation over specifically 
defined score ranges. A revisiting of previously proposed rules for dealing with scores of zero 
frequencies in raw post-stratification equating (Jarjoura & Kolen, 1985; Hanson, 1991; Hanson et 
ah, 1994; Kolen & Brennan, 2004; Livingston, 2004) and the implications of these rules on the 
accuracies of equating functions and variability is also warranted. The empirical evaluation of the 
described procedures and their relationships is another important issue that is addressed in an 
accompanying research report (Moses & Holland, 2007). 
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Appendix 

Deriving the Standard Errors of Equating for Raw Data 

This appendix gives the derivations for efficient raw SEE and SEED computations that 
avoid directly forming JL by JL and KL by KL C-matrices. First a basic result is given for the raw 
C-matrix and the product of anything multiplied by the raw C-matrix. Then the derivations of 

(Je^DF )C 

are given for the single-group design (X-to-A in P). The single-group results are then 
easily applied to chained equating. Then the (J^df )C 

is described for post-stratification. Finally, 

the SEED equations are given. 


The Basic Result 

In getting to the SEE-vector for raw data, we first get the row vector J e Jnr and then 
premultiply it into C of the form in (8). Let v denote a row vector of the appropriate dimension 
for the multiplication to make sense, then 

v C = v (l/V^XD^ - Pa/p*”) = (1/V^XvD^ - vpV7). (Al) 

Since p is a column vector, vp is the mean of the elements of v using the probabilities in p, so 
denote this by p v[ >. Furthermore, vD v /p is a row vector with coordinates . Thus, vC is the 

row vector with elements 

(VylN)(yj -\i vT )y[pj. (A2) 

Finally, the squared length of vC will be the sum of squares of the elements in (A2) or 1 IN times 
the variance of the elements of the vector v, that is, 

||v C|| 2 = (1/A)a^ p . (A3) 

As this paper will show, the SEEs for the various designs when there is no presmoothing will 
always involve one or more quantities of the fonn (A3). 
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The Single-Group (SG) Design (X-to-A in P ) 

Following von Davier et al. (2004b), the array of interest is P = (l/7V)n, the two-way table 
of bivariate cell proportions. Furthermore, P is regarded as vectorized into a column vector 
denoted as p. If P is ,/by L, then p is JL by 1. It is now p that is assumed to have the multinomial 
distribution. 

Continuing to follow von Davier et al., the vector of raw score proportions for the two 
tests, X and A, that is, (r l , t 1 ) 1 , is computed as, 



fr> 


(M) 

(r l , t‘)‘ = 

It; 

= 

L Oy 


P, 


(A4) 


where M and N are defined by, 
M = (Ij,. . . Ij), 

f l\ 0 ... (T 

o i; ... o 
o= . . , 

,0 0... IJ, 


(A5) 


(A6) 


and where Ij denotes a J by J identity matrix and Ij denotes a row J-vector of all I’s. M is./ by 
JL and O has L rows and is L by JL. Hence, by definition, 


Jdf - 


"ivT 

,oJ’ 


(A7) 


aJ + LbyJL matrix. 

The C-matrix for this case is exactly as before, 

c = (i/Viv)(D # -pVp 7 ). 

But p is now a JL-vector, partitioned as, 
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( Pl^ 


vp l 9 


where, pi, a column ./-vector, is the I th column of P. 
To compute J e ( X ) Jdf = (J r , Jt) Jdf, observe that 

Oy 


(J r , Jt) Jdf (Jr? J.) 


— J r M + Jt O. 


Then, 


J r M = J r (Ij,. . •, Ij) = (J r , • • •, J r ), 


(A8) 


and 


Jt O = J, 


T o 
o 1: 


0 0 


=( j t , ii) 




(A9) 


Hence, the (j, l) th element of J r M + J t O is 


J , + J t ,. 


(A10) 


Thus, applying (A2), the (j, l) th element of the SEE-vector for the SG design is 


A)(J rj +J t| E(j r +j t )p 


jl ’ 


(All) 


where. 


^(Jr+J^P - X( J r + K)Pj 


j.l 


so that the SEE for the SG design is given by the square root of 


SEE 2 (x) = (1/AT) a (Jr+J,>r- 


(A 12) 
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The NEAT Design—Chain Equating 

Chain equating (CE) uses the results of two embedded single-group designs. In the NEAT 
design there are two bivariate arrays of probabilities, P and Q, where P is Jby L and Q is K by L. 
Again, it is assumed that they have been vectorized into p and q, respectively, and again the 
sample and population proportions are mixed. In CE, the quantities that are used are the four 
probability vectors r P , t P , t Q , sq, defined in von Davier et al (2004b). In both this and the next 
section, j is used for the coordinate subscript for r, k for the subscript for s and 1 for the subscript 
for t, that is, r h s k and ti. 

For CE, the equating function, ey(x; r P , t P , tQ, Sq) is the composition of two simpler 
functions, i.e., 

ey{x; r P , t P , tQ, sq) = ey(e a (x ; r P , t p ); tQ, sq) 

where e A (x; r P , t P ) is the single-group link from X to A and ey{a: t Q , s Q ) is the single-group link 
from A to Y. Then, as shown in von Davier et al. (p. 84), the SEE 2 (x) for CE is the sum of two 
parts corresponding to the two SG links, that is, 

(~)p 

SEE 2 (x) = ( e a ( x ))] 2 SEE 2 (x) + SEEy (e a (x)) _ (A13) 

Now apply (A 12): 

sEEfr) .(i/ W Z(i, +- 1 ., -Kp + t*j,p y>p t -(„ W Z( J , +- 1 ., -t*(j r +j,)p) ! ^ 

j, i j.i 

(A 14) 
and 

SEE*(e„(x)) = (l/jV s )XX +J t| -O'j.q +(*j,q))V = 

k, l 

(\INq)Tj ( J s k + J t, - E(j s+ j,)q) q k ,. (A 15) 

k,l 

In (A 15), the Jacobians and their mean values need to be evaluated at the converted x-scores via 
e a (pc; r P , t P ). The SEE for CE in the NEAT design is the square root of the quantity in (A 13). 
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The NEAT Design—Post-Stratification Equating 

The use of the data from the NEAT design for post-stratification equating assumes that the 
arrays, P and Q, have been vectorized into p and q. The C-matrix for the vector of unsmoothed 
and unaveraged proportions (p\ q 1 ) 1 is given by 


C = 




(A 16) 


where 

Cp= (1 /V^XD^-pVp 7 ) 
and 



Furthermore, the Jacobian of the design function for post-stratification equating in the 
NEAT design is given by 


Jdf - 


' dr 

dp 

ds 

vdp 


dr N 

dq 

ds 

dq> 


(A 17) 


and, as before, 

Je — (Jr? Js)- 

Hence, 

J e J DF = (A,B), (A 18) 

where 


dr 


A = J r — 

dp 



(A 19) 
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(A20) 


dr 0s 


B — J r „ + J, 


0q 0q ’ 


Using (A 16) and (A 18), 


SEE-vector = (A, B) 


( C p 0 ^ 

V° C q2 


= (A C P , B C Q ). 


(A21) 


2 2 

From (A21), the squared length of the SEE-vector in this case will be ||ACp|| + ||BCq|| , and from 
(A3) each of these components will be 1 IN times the variance of the components of A or B. A 
notation can be used that is like the notation used earlier: 


EE 2 (x) = ( 1/Np) C>; P + ( 1/Nq) vl Q , 


(A22) 


where CJ A p denotes the variance of the elements of A with respect to the bivariate distribution P, 

2 

and o B q denotes the variance of the elements of B with respect to the bivariate distribution Q. 

Hence, all that remains is to work out what the elements of A and B are. This is more 
complicated than the previous cases because of the form of the Jacobian of the design function for 
post-stratification equating in the NEAT design. 

To proceed further it may be useful to get the dimensions of the arrays in (A 18) to (A20) 
clear. A is 1 by JL , and B is 1 by KL, so they are the same shapes as p l and q l , respectively. More 
importantly, A and B may be partitioned L blocks in the same was that p and q are, that is 

A = (Ai,. . . , Ai,. . . , A l ), 

B = (Bi, . . . , Bi,. . . , B l ). 

Next, the expressions for the elements of the partitions for A and B are obtained. These are: 

dr 0s T dr 3pj _ T 0s Sp, 

Ai Jr-, + J S - r -— -\ s —— 

0Pl 0pj 0Pj CPj 0pj Cp, 
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( T dr 0s ^ 3p, 


(A23) 


0r 0s dr dq x 0s dq x 

®1 = JrT +Js2^= J r a _ ~ +J s a _ ~ 


0qi s 0qj 0qj dq x 


dq x dq { 


( T dr 0s ^ dq, 

= l r fif + ’ 3ifJ dqf 


(A24) 


In (A23) and (A24), the bivariate probability vectors with averaged probabilities (P and 0) are 
distinguished from the original bivariate probability vectors (p and q). This distinction is 
important, because the raw post-stratification equating function, synthetic marginal probabilities 
(r and s), and algebraic development of J e Jdf that follows are based on P and *1, while the final 
computation of (J^df )C 

utilizes a C that is based on p and q. Additional multiplications of 
J e JDF are needed for derivatives that are in tenns of the original bivariate probabilities. They are 

Sp, _ J / \ 1 ^ _ 1 , 

of the form a - <7 (Pi + Pi+i.+j, I - , , where Opi is the number of anchor score levels 

0p, 'b n ) b PI ’ 

with marginal probabilities that are averaged together. 

dr 

von Davier et al. (2004b, p. 76) provide formulas for the matrices, 77^, and so on, that 

0 Pl 

can be exploited for the J e Jdf of post-stratification. These are: 




(A25) 


= w[(%r' qjlj 


(A26) 


(i-wx^r'pji^ 


(A27) 
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0S 


(A28) 


= Wi^K-wCy^X^r'qiK. 

0qj 

In (A25) to (A28), the quantities w ip and w iq are defined as 

w,p = w + (1 - w)t & / t pi = (wt pi + (1 - w)t & )/t Pl = t Tl /t m ^ 
and w iq = ( ] - w ) + w ( t pi /t e,) = t n /t e,.. 

Next, 

By . _ , 

J r 2 — = w lP J r -(l -w){t dl /t p ,)[(t- pl ) J r p,]l j? 

0Pl 

0S 

j s — = 

0Pl 

and 

By 

j — = (1 - w)[(Q ; ) _1 J r Pj]P K 

0qi 

0S 

J s f— = w iq J s - w (^; / %)[(^)' lj s q 1 ]lK. 

0q L 

Quantities of the form (t Pl ) J, P| occur in each of the above expressions so it is useful to 
have a separate notation for them and in fact they are all conditional means. They are defined as 

Aj_.pi, = E J r, (P/hi) = (Q / r'J r p 1 , (A31) 

Aj s q|i = = {tgi) J s q,. (A32) 

Because { PjJh ,} is the conditional probability distribution of X given A in P, it is clear that 
Aj r p|i is the conditional mean of J r given A over P. A similar interpretation holds for Aj s q|, , thus: 


(A29) 

(A30) 
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= w n>J r "(1 " H ’)^g/^/v)Mj r P|i Vi, 


(A3 3) 


L/CJ 

s — = w V>m l i, 


(A34) 


J r — = (^^KpiiIk, 


(A3 5) 


0s 

J s ~ 

dq { 


W 1Q^S ' W (t p i /?g; )Aj s Q 111 K . 


(A3 6) 


Hence, Ai and Bi in (A23) and (A24) can be examined in more detail: 


( r dr ds^ldp, i . .. 1 

Ai= ^ J r — + J s — J~^~= l w ip J r-( 1 - w )(^«Kp|i 1 J +w ^.Qii 1 jJ^, (A3 7) 


( _ 0r 0s ^ <3q. 


(( ]1 - w )A Jr P |! Ik + w,Q j, - Mip/tg, )A Js q,! Ik ) T^. 

Uni 


(A3 8) 


Equations (A29) and (A30) give the following relationships, 


(1 - w)t Ql /t rn = w S -W' and w(t pi /t 0l ) = w,q -(1 -w) 


(A3 9) 


Equations (A37) and (A38) are simplified using (A39) as follows: 


( _ 0r 0s ^ dp, 

A '=l Jr ^ + 


W lP J r “(Wjp - wV/jpuIj + WfU jM lj J — 

Dpi 


(A40) 
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Bi = I J 


( _ 0r 0s ^ da, 

l J ^ + ' 


dq,) dq l 


= (!->v)// Jr p|il^+w lQ J s -(w 1(5 -(l->v))// j5Q|1 lU| — .(A41) 


-1' ) W 

w 0( b J- 


Hence, 


( _ 0r 0s ^ 0p, 


w ip(J r /^j r p|i^j) + w (^j r p|i + ^j,qii)^j ) i , (A42) 


and 


( _ 0r 0s 0q, 

Bl= l 'Sff + ’ fif) fiT 


= [ W 1 Q (J S Aj 5 Q|i1k) + (1 W )(Aj r p|i + Aj s q|i)1k) , . (A43) 

°Q 1 


All that remains is to give the individual coordinates of the vector partitions, A] and Bi. These are: 


Aji = ( W lp(Jij Aj r p|i) + W (Aj p p|i + Aj s Q|i) 


and Bki = \ w iq^\ Aj sQ |i) + (l 


^qi 


(A44) 

(A45) 


Finally, we use (A44) and (A45) to obtain the two variances in (A22) to get the SEE for 
the post-stratification method of equating in the NEAT design. 

The NEAT Design—Standard Errors of Equating Differences (SEEDs) 
Curvilinear-Linear SEEDs 

For kernel equating functions, curvilinear and linear equating functions for a given method 
(either post-stratification or chained) can be computed by varying the continuization bandwidths 
(h ’s). The derivative of the (ei - ei) difference is simply the difference between the equating 

functions’ derivative vectors, ^( e i ~ e i) ~ J e i - J e 2 , so that, 

SEED y (x) = ||(J el - J e2 )J D F^-| = I J e l-e2(A46) 
(von Davier et al., 2004b, p. 80). 
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The implication of (A46) is that raw curvilinear-linear SEEDs can be computed by 

utilizing all of the raw SEE results with the substitution of J e i " ^2 for J e . 

The SEED analogues of (Al) and (A2) are: 

SEED - Vector, (x) = -^=|(J el - J e2 )J DF D^ — (Jei " Je 2 )JDFP'\/p _ j, and (A47) 

SE ED y (x) = ^ ((Jell - J e2 j )J D F “ ^u eX -j el )j DF ) Pj . (A48) 

Equation (A48) makes it clear that the raw SEED is the standard deviation of the 
difference between equating functions’ derivatives for a given design and data set. 

To compute the raw SEEDs for kernel post-stratification, simply use J r i " J r 2 and J s i " J S 2 

in place of J r and J s in (A22), (A31-A45). The curvilinear-linear post-stratification SEED 
elements are: 


A, 


(el-e2)jl ' 


( W lp(Jrlj Jr2j A(J rl -J r2 )P 11) + W (A(J rl -J r2 )P 11 + A(J s , -j s2 )Q 11)) . 


and B(el-e 2 )kl = \ W !Q^ sl k ' ls2 t A ( j s1 _j s2 ) Q |i) + (1 W ’)^(J rl -J r2 )P|l + A(J sl -J s2 )Q|l)) k . 


Qi 


(A49) 

(A50) 


The raw SEED for chained kernel requires a little more than the simple substitution of 

8Cy 

Je\ ~ Jel for J e . There are two “^“( e a C*)) ’s that are needed, and these must be incorporated into 
specific parts of the Ai ~ Jei computations: 


SEED 2 (x) = (l/^)SEEDf(x) + (l/A e )SEED 2 (e a (x)) ? 
where SEED 2 (x) = 


(A51) 


^ del y (el a (x)) 8e2 r (e2 a (x)) 

2—1 V „ 1 rl; ' fil, 1 


J,1 


da 


da 


(J l2 +j„,)-n a, . -f *. Se2_ _Y Pn (A 52 ) 




SEED 2 (e fl (x)) = Z( J si k ” J s2 k + J ti, “ J t2, "fi(J sl -J s2 )Q '^(J tl -J t2 )Q^ 


<Ih 


k,l 


(A53) 
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Chained-Post-Stratification SEEDs 

The SEED vector can also be expressed more generally as 

SEED y (x) = ||(J el J 

DFl " ^e2 JdF2 ) C IL (A54) 

Equation (A54) can be used to estimate the variability of the difference between equating 
functions based on different design functions, such as two post-stratification equating functions 
where w is varied, or between chained (el) and post-stratification (e2) equating functions (von 
Davier et al., 2004b p. 81). 

Based on the definition of C in (3), the raw SEED in (A54) can be computed as follows: 
SEED y (x) = 


(J 


elP^DFlP " Je2pJDF2P 


)<V+ (j 


cIqJdFIQ " Je2QJDF2Q 


)C ( 


(A55) 


From (A55), there are two major parts to the SEED: 


SEED y (x) = ^SEEDjp(x) + SEED^(x) 


(A56) 


which are now separately simplified. 
From (Al), 


SEED yP (x) = 


~N~p 


(JelP^DFlP “ Je2pJoF2p) D (JelP^DFlP " Je2P JdF2p)pVp _ 


SEED; 0 (x) = -^ 


N n 


(JelO^DFlO ” Jc2qJdF2q) D ^ (JclQ^DFlO ” J e 2Q ^ DF2Q )l 


(A57) 

(A58) 


Factoring out all Vp' ’s and \jq ’s and noting that products of row and column vectors are 


sums, 


1 

SEED y p(x) = —— ( J e ipJ D FlP ” Je2pJDF2P ” (^elP ^DFIP ” Je2P JdF2P )p) P . 

ly P 

SEED yQ (x) = (JcIqJdFIQ ” Jc 2 qJdF 2 Q “(JelO^DFlO ” ^e 2 Q JdF 2 Q )l) 1 . 

■L' r 


From (A2), the elements multiplied by p and q vectors are means: 


(A59) 

(A60) 
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2 1 ( 

SEED yP (x) = J e ipJ D FlP " *^e2P t ^DF2P ~ (/f/ e7 r J, 


1 


- u, , 

IP^DFIP r J e2P J DF2P 


))" p 


SEED q(x) JclQ JdFIQ " Je20 JdF20 (Mj 


' y Q w yy («.1Q«DF1Q ** t2Q u DF2Q ^J eIQ J DFIQ VJ e : Q J D F2Q 


2 

)) q 


•(A61) 

(A62) 


From (25), (26), (A 13), (A 14), (A 15), (A44), and (A45), 


\2 _ 1 (deI Y (eI a (x)) 


SEED' (x) = —— 


yP v ' N p \ da 


(J r i,p ~ l " Jti,p " Ej rl p ■ Ej tl p) A 2P + // A d p 


SEED yQ (x) ( J s i,q + ^ti.Q " Ej s1 q " Ej tl Q ^ap^/^Ajp) 1. 


(A63) 

(A64) 


Then the elements of (A56) can be computed as 


SEED ■ W 4£(W W . 


N r ^ 


da 


SEED yQ (x) ^(^s t i,Q + ^t,i,Q "Ej s1 q "Ej„q ^ki,2 P + /^ 

yv e o 


M'Jjjp) Ajl,2P + Ma 2? ) Pjl. 

(A65) 

+ / / A 2Q ) %l . 

(A66) 
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